Tight Bounds on the Redundancy of Huffman Codes

نویسندگان

  • Soheil Mohajer
  • Payam Pakzad
  • Ali Kakhbod
چکیده

Consider a discrete finite source with N symbols, and with the probability distribution p := (u1, u2, . . . , uN). It is well-known that the Huffman encoding algorithm [1] provides an optimal prefix code for this source. A D-ary Huffman code is usually represented using a D-ary tree T , whose leaves correspond to the source symbols; The D edges emanating from each intermediate node of T are labeled with the D letters of the alphabet, and the codeword corresponding to a symbol is the string of labels on the path from the root to the corresponding leaf. Huffman’s algorithm is a recursive bottom-up construction of T , where at each time the smallest D probabilities are merged into a new unit, and henceforth represented by an intermediate node in the tree. Throughout this paper, unless D is explicitly specified, we talk about the binary Huffman codes. Denote by l(u) the length of the path from the root to a node u on the Huffman tree T . Then the expected length of the Huffman code is defined as

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Redundancy-Related Bounds on Generalized Huffman Codes

This paper presents new lower and upper bounds for the compression rate of optimal binary prefix codes on memoryless sources according to various nonlinear codeword length objectives. Like the most well-known redundancy bounds for minimum (arithmetic) average redundancy coding — Huffman coding — these are in terms of a form of entropy and/or the probability of the most probable input symbol. Th...

متن کامل

Tight Bounds on the Average Length, Entropy, and Redundancy of Anti-Uniform Huffman Codes

In this paper we consider the class of anti-uniform Huffman codes and derive tight lower and upper bounds on the average length, entropy, and redundancy of such codes in terms of the alphabet size of the source. The Fibonacci distributions are introduced which play a fundamental role in AUH codes. It is shown that such distributions maximize the average length and the entropy of the code for a ...

متن کامل

Bounds on Generalized Huffman Codes

New lower and upper bounds are obtained for the compression of optimal binary prefix codes according to various nonlinear codeword length objectives. Like the coding bounds for Huffman coding — which concern the traditional linear code objective of minimizing average codeword length — these are in terms of a form of entropy and the probability of the most probable input symbol. As in Huffman co...

متن کامل

A simple upper bound on the redundancy of Huffman codes

Upper bounds on the redundancy of Huffman codes have been extensively studied in the literature. Almost all of these bounds are in terms of the probability of either the most likely or the least likely source symbol. In this correspondence, we prove a simple upper bound in terms of the probability of any source symbol.

متن کامل

On the Redundancy of Optimum Fixed-to-Variable Length Codes

There has been much interest in recent years in bounds on the redundancy of Huffman codes, given only partial information about the source word distribution, such as the probability of the most likely source. This work determines upper and lower bounds for the redundancy of Huffman codes of source words which are binomially distributed. Since the complete distribution is known, it is possible t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005